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1 Introduction 

This paper is a direct continuation of [10] . where we have investigated the normal approximation 
of random variables belonging to a hxed sum of Poisson Wiener chaoses, with special emphasis 
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on [/-statistics living on the support of a Poisson random measure. As we will see below, strong 
motivations come from a fundamental paper by Reitzner and Schulte |25j . where a first con- 
nection between Malliavin operators and limit theorems in stochastic geometry was established, 
and several bounds were obtained via an extensive use of product formulae for multiple integrals 
(see e.g. \21\ Chapter 6]) - with applications e.g. to statistics based on random graphs, or to 
the Gaussian fluctuations of the intrinsic volumes of /c-flats intersections of a convex body. 

As discussed in more detail in Section [21 the main contribution of |10j was the derivation of 
general upper bounds, expressed in terms of contraction operators, on the Wasserstein distance 
between the law of a finitely chaotic random variable and of a centered Gaussian distribution. As 
further demonstrated by the examples developed in this paper, we believe that contractions are 
the most natural object for dealing with the normal approximation of random variables having 
a finite chaotic decomposition (like [/-statistics): for instance, our bounds yield conditions for 
central limit theorems that are in many instances necessary and sufficient, and automatically 
imply joint CLTs (with bounds) for chaotic components of different orders. 

Remark 1.1. As a complement to the analysis developed in [10J, one should note that bounds 
based on contraction operators (such as the one appearing on formula (|2.7p below) are con- 
siderably simpler than those obtained by developing expectations of Malliavin operators via 
diagram-type multiplication formulae (such as the one stated e.g. in \21\ Theorem 6.1.1]). One 
combinatorial reason for this phenomenon is that, in the jargon of diagram formulae (see |21| 
Chapter 6]), computing norms of contractions requires one to assess integrals labeled by parti- 
tions having blocks of size either two or four that can be arranged in a circular way (see \21\ p. 
49] for definitions and illustrations), whereas bounds based on diagram computations contain 
integrals associated with general noncircular partitions having possibly blocks of size 3. 

The theoretical findings of |10| were applied to characterize the Gaussian fluctuations of edge 
counting statistics associated with general random graphs, as well as to describe a 'Gaussian-to- 
Poisson' transition for random graphs with sparse connections. When applied to the so-called 
disk graphs (see e.g. Penrose |23|). the results about Poisson limits yield Poissonized versions 
of classic findings, e.g. by Jammalamadaka and Janson [7] and Silverman and Brown |31| . 
Note that Poisson approximations were established in |10] by the method of 'diagram formulae' 
(see |21[ Chapter 7]), and then further refined (in a much more general framework) in |18| by 
combining the Malliavin calculus of variations and a classic version of the Chen-Stein method. 

The principal aim of the present paper is to extend the results proved in |10| in order to 
study the Gaussian fluctuations of [/-statistics (of a general order) with rescaled kernels, based 
on the points of a marked point process. As shown in Section [8] marked point processes emerge 
naturally in a number of applications, as for instance those involving Boolean models. Another 
contribution of the present work is an exhaustive characterization of the fluctuations of geometric 
[/-statistics, thus completing the analysis initiated in |25 1. Section 5]: this point will be dealt 
with in Section [7J by applying the classic theory of Hoeffding decompositions for symmetric 
[/-statistics based on i.i.d. samples (see e.g. [33]). 

Remark 1.2. Some of the central limit theorems deduced in the present paper as well as in |10| 
could alternatively be obtained by combining the results of [21 [7] with a standard poissonization 
argument. Unlike our techniques, this approach would not yield explicit bounds on the speed of 
convergence in the Wasserstein distance. 

We stress that the main results of |10j and |25| . that constitute the theoretical backbone of 
our analysis, were obtained by means of the techniques developed in |191 122] . that were in turn 
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based on a combination of the Malliavin calculus of variations and of the so-called Stein's method 
for normal approximations (see e.g. [3] for a general reference on this topic). Other remarkable 
contributions to the line of research to which the present paper belongs are the following. In [3], 
Decreusefond et al. present applications of the findings of |19] to the Gaussian fluctuation of 
statistics based on random graphs on a torus; reference |14] contains several multidimensional 
extensions of the theory initiated in |25] : the paper |18] contains some general bounds associated 
with the Poisson approximation of the integer- valued functionals of a Poisson measure (further 
applied in |30| to the study of order statistics); references |28| and |29] use some of the techniques 
introduced in |19[ [22] in order to deal, respectively, with Poisson- Voronoi approximations, and 
with the asymptotic fluctuations of Poisson fc-flat processes; further applications to [/-statistics 
are discussed in [11] , 

The remainder of the paper is organized as follows. Section [5] presents some background 
material as well as a description of the main problems addressed in the paper. In Section 
[3] we discuss a preliminary example about subgraph counting, which is meant to familiarize 
the reader with our approach. Section contains fundamental estimates involving contraction 
operators; Section [5] and Section [6] deal, respectively, with random variables living in a finite sum 
of Wiener chaoses and with rescaled [/-statistics with a stationary kernel; Section [7] contains 
the announced characterization of the asymptotic behavior of geometric [/-statistics, whereas 
Section[8]is devoted to applications - in particular, to the Boolean model, and to random graphs. 

2 Framework and goals 

This section contains a general description of the mathematical framework of the paper, as 
well as of the main problems that are addressed in the sections to follow. We start with a 
synthetic description of the results proved in |10| . which constitute the theoretical backbone of 
our analysis. One should also note that reference |10) uses in a fundamental way the Malliavin 
calculus techniques developed in |19| [22] . 

2.1 Asymptotic normality on the Poisson chaos: a quick overview 

(i) (Framework) Throughout this section, we shall consider a measure space of the type 
(Z, 3f,ji), where Z is a Borel space, 2? is the associated Borel <7-field, and \x is a cx-finite 
Borel measure with no atoms. We shall denote by rj = {rj(B) : B G 3f, ^{B) < oo} a 
Poisson measure on (Z, 3f) with control measure ji, which we assume to be defined on 
some adequate probability space (Q,^,P). We shall assume that & is the P-completion 
of the a- field generated by rj, so that L 2 (P) = L 2 {Q,,^,P) coincides with the space of 
square-integral functionals of rj. We write fj = r)—ji for the associated compensated Poisson 
measure. For every k ^ 1, the symbol L 2 (Z k ; fi k ) (that we shall sometimes shorten to 
L 2 (Z k ) or L 2 (fi k ) if there is no ambiguity) stands for the space of measurable functions on 
Z k that are square-integrable with respect to /A As usual, L 2 (Z k ; ji k ) = L 2 (Z k ) = L 2 (fi k ) 
is the subspace of L 2 (fi k ) composed of functions that are //^-almost everywhere symmetric. 
We shall also adopt the special notation U>*\Z k ) = U } {Z k ) n L p '(Z k ), for p,p' ^ 1. 

(ii) (Multiple integrals and chaos) For / € L 2 (fi q ),q ^ 1, we denote by I q (f) the multiple 
Wiener-Ito integral, of order q, of / with respect to fj, that is: 




(2.1) 
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Where the symbol {Z q )' indicates that all diagonal sets have been eliminated from the 
domain of integration. The reader is referred for instance to |21[ Chapter 5] for a complete 
discussion of multiple Wiener-Ito integrals and their properties. These random variables 
play a fundamental role in our analysis, since every square-integrable functional F = F(rj) 
can be decomposed in a (possibly infinite) sum of Wiener-Ito integrals. This feature, 
known as "chaotic representation property", is the object of the next statement. 

Proposition 2.1. Every random variable F £ L 2 (P) admits a (unique) chaotic decom- 
position of the type 

oo 

F = E[F]+J2li(fi), (2.2) 

i=i 

where the series converges in L 2 (P) and, for each i ^ 1, the kernel fi is an element of 

(iii) (Wasserstein distance) Let N be a centered Gaussian variable with variance a 1 > 0. 
In this paper, we will be interested in assessing the distance between the law of N and 
that of a random variable F € L 2 (P) having the form 

k 

F = E[F]+^I qi (f i ), (2.3) 
i=i 

where k ^ 1, 1 ^ qi < qi < ■ ■ ■ < q^ are integers and, for every 1 ^ i ^ k, fi is a non-zero 

element of L 2 ([i qi ). The Wasserstein distance between the law of F and the law of iV is 
defined as 

d w (F,N) = sup E\h(F) -h(N)\, (2.4) 

where Lip x stands for the class of Lipschitz functions with Lipschitz constant ^ 1. In 
the forthcoming Theorem 12.41 we shall collect results from [10J, allowing one to assess 
d\Y (F, N) by means of expressions involving the kernels of the Wiener-Ito expansion of F. 

(iv) (Contractions) The main bounds evaluated in this paper are expressed in terms of the 
contractions of the kernels fi (see e.g. |10|. [T9] for full details). Given two functions 
h £ L 2 (Z P ; n p ) and g £ L 2 (Z q ; /i q ) (for some p,q ^ 1), we shall use the following notation: 
for every ^ / ^ r ^ min(g,p), and whenever it is well-defined (see the discussion at 
Point (v) below) , the contraction of g and h is the function in p + q — r — I variables given 

by 



,x' q - r ,yi,...,y r -i) (2.5) 
j Xp_ r , y\ , . . . , y r —i i z\ , . . . , z{) 
Xg(x[,.. .,x'g_ r ,y 1: . . .,y r -i,zi, . . .,zi). 

In particular, if p = q one has that h-kpg = (h,g) L 2^ zv -j, the usual scalar product of g and 
h. For the sake of brevity, we shall often use multi-dimensional variables, represented by 
a bold letter and indexed by their dimension: in this way, (|2.5p can be rewritten as 

h*\ #(x p _ r ,Xg_ r y r _;) = J ^ /i(x p _ r .,y r „;,z;)g(Xg_ r ,y r _ i ,z ; )d^, 



^ *r di^l j • • • j •Ep—ri 2-1 j • • • 

fJ'idzi, ...,dzi)h(xi, . . . 
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for x g _ r G Z"~ r , X ;„ r € ZP~ r , y r _; G Z T -'. Finally, we observe that, if h* r g is square- 
integrable, then its squared L 2 norm is given by the following iterated integral: 

ll^*r dW^f^Zp+q-r-l.^p+q-r-lj (2.6) 

= / h(xp- r , y r _i, z i )/i(x p _ r , y r _ z , z'j)$r(Xg_ r , y r _z, z^c^x' y r _;, z^)d^ p+9 ~ r+i . 

(v) (Assumptions on kernels) The following assumption will be always satisfied by the 
random variables considered in this paper. 

Assumption 2.2. Every random variable of the type (|2.3|) considered in the sequel of 
this paper is such that the following properties (l)-(3) are verified. 

(1) For every i = l,...,d and every r = l,...,qi, the kernel fi * q l ~ r fi is an element of 
LV). 

(2) For every i such that qi ^ 2, every contraction of the type (zi, Z2q i — r —i) H > \ fi\ * l r 
\fi\(z\, ziqi—r— l) is well-defined and finite for every r = l,...,^, every / = l,...,r 
and every (z\, z 2qi - r -i) G Z 2 *"^. 

(3) For every i,j = 1, d such that max(gj, qj) > 1, for every k = \q%— <Zj|Vl, qi+qj — 2 
and every (r, Z) verifying k = qi + qj — 2 — r — /, 



(/iOv) *£/>■(*> -)) 2 ^ 

Z k 



fj,(dz) < oo, 



where, for every fixed z£Z, the symbol •) denotes the mapping (z±, Zq— i) i— » 
fi(z,Z!, ...,z q _i). 

Remark 2.3. According to |22[ Lemma 2.9 and Remark 2.10], Point (1) in Assumption 
2.21 implies that the following properties (a)-(c) are verified: 



(a) for every 1 ^ i < j ^ k, for every r = l,...,gj A qj and every I = l,...,r, the 
contraction fi * l r /,• is a well-defined element of L 2 (fi qi+q i~ r ~ l ); 

(b) for every 1 ^ i ^ j and every r = 1, qi, fi*® fj ^ s an el emen t of L 2 (// 9i+ *' -r ); 

(c) for every i = 1, fc, for every r = 1, gj, and every I = 1, r A (qi — 1), the kernel 
fi * l r fi is a well-defined element of L 2 ([i 2q% ~ r ~ l ). 

In particular, every random variable F verifying Assumption 12.21 is such that I qi (fi) 2 G 
L 2 (P) for every i = l,...,k, yielding in turn that E[F ] < oo. Note that Assumption 12.21 
is verified whenever the kernels fi are bounded functions with support in a rectangle of 
the type B x • • • x B, /i(-B) < oo. 

(vi) (The bound B 3 ) For F as in (|2\"3]l . let cr 2 = £[F 2 ]. Following [10], we set 

B 3 (F;a) = ^ jmax / j || L2( ^.+, j _ r _ l) + .max^ H/iHl^) J , (2.7) 

where max ranges over all quadruples (i,j,r,l) such that 1 ^ I ^ r ^ ^ (?j (£,j fe) 
(*) 

and I ^ qj (in particular, quadruples such that I = r = qi = qj = 1 do not appear in the 

argument of max) . 

(*) 
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(vii) (Normal approximations) Now fix integers k 1 and 1 ^ q± < qi < ... < q k , and 
consider a family {F x : A > 0} of random variables with the form 

k 

FX = Y,IM*), A>0, (2.8) 

with kernels fi\ verifying Assumption 12. 21 for each A. We also use the following additional 
notation: (i) a 2 = E[(F X ) 2 ], (ii) F iA = I qi (fi, x ), i = l,...,d, and (iii) a 2 x = E[(F ijX ) 2 ]. 
The next result collects some crucial findings from |10] , 



Theorem 2.4. (See |10| ) Let {F\} be a collection of random variables as in 
where the integer k does not depend on A. Assume that there exists a 1 > such that 
liniA^ooO-^ = a 2 . Let N ~ ^V(0,a 2 ). 

1. For every A, one has the estimate 



d w (F x ,N)^C xB 3 (F x ;a x ) + ^^-\a 2 x -a 2 \, (2.9) 

o- X V a 

where Cq is some constant depending uniquely onk. In particular, if B%{F X ; o~ x ) — > 0, 
as A — > oo, then d\y(F x , N) — > and therefore F x N. 

2. Assume that fi jX ^ for every i, X, and also that the family {Ft '■ X > 0} is uniformly 
integrahle. Then, the following conditions (a)-(c) are equivalent, as n — > oo: (a) 
d w (F x , N) -> 7 (b) B 3 (F X ; a x ) -4 7 and (c) E[F$ - 3a\ -4 0. 

3. Let (p : R k — > R be a thrice differ entiable function with bounded second and first 
derivatives. Then, if B3(F x ;a x ) — » 

E[<p(Fi tX , F k>x )] - E[(p(N ltX , N ktX )] 0, n -4 oo, 

where (Ni x , . . . , N k x ) is a centered Gaussian vector with the same covariance matrix 
as (Fi jA , . . .,F k ,x). 

Remark 2.5. In the statement of Theorem I2.4[ we implicitly allow that the underlying 
Poisson measure 7] also changes with A. In particular, one can assume that the associated 
control measure fj, = fj, x explicitly depends on A. 



2.2 A general problem about marked point processes 

In this paper, we shall mainly deal with the normal approximation of functionals of marked 
Poisson point processes. Here is the general problem we are interested in. 

Problem 2.6. Let X be a compact subset of R rf endowed with the Lebesgue measure £, and 
let M be a locally compact space, that we shall call the mark space, endowed with a probability 
measure v. We shall assume that X contains the origin in its interior, is symmetric, that is: 
X = —X, and that the boundary of X is negligible with respect to Lebesgue measure. We set 
Z = X x M, and we endow such a product space with the measure fi = t ® v on R rf . Let 
{a x : A > 0} be a collection of positive numbers. Define fi x = Xfi, and let 77^ be a Poisson 
measure on Z with control measure ijl x . For every 1 ^ i ^ k and every A > 0, let hi tX be a fixed 
real function belonging to L 2 ((a x X) 1 x M l ). Let F x be defined as in (|2.8p . where the multiple 
integrals are with respect to fj X = rj X — A/U, and assume that each kernel fi x is of the form 

/»,a(x») = 7i, x h^ x (a x xi), Xj 6 Z\ 
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for some ji \ > 0. Which conditions on the 7^, a\ and hi t \, i = 1, . . . , k, yield the asymptotic 
normality of 

F x = Fx - E[Fx] , asA^oc? 
TVarT^O 

Sufficient conditions for asymptotic normality, together with explicit estimates for the quan- 
tity B3 appearing in Theorem I2.4[ are derived in Theorem 15.11 



Remark 2.7. (a) In the above formulation of Problem 12.61 introducing the scaling factor a\ 
might seem redundant (since h^x also depends on A). However, this representation is 
convenient for the applications developed below, where, for each i = 1, . . . , k, the kernel 
hi t x will t> e assumed to converge to some global function hi that does not depend on A. 

(b) It is proved in |27| Theorem 3.5.8] that any stationary marked Poisson point process rj has 
intensity measure of the form /ix = A£ <8> v, where A > 0, I is the d-dimensional Lebesgue 
measure, and v is a probability measure on M. In the parlance of stochastic geometry, a 
stationary marked point process rj = {(ti,rrii)} is therefore always obtained from a ground 
process 770 = {U}, that is, from a Poisson point process with intensity X£, by attaching to 
each point ti an independent random mark rrii, drawn from M according to the probability 
distribution v. 



2.3 Rescaled marked [/-statistics 

Following |25| Section 3.1], we now introduce the concept of a U -statistic associated with the 
Poisson measure 77. This is the most natural example of an element of L 2 (P) having a finite 
Wiener-Ito expansion. 

Definition 2.8. ([/-statistics) Fix k ^ 1. A random variable F is called a U-statistic of order 
k, based on a Poisson measure r\ with control 77, if there exists a kernel h € L*(/j fc ) such that 

F=^ f h(x), (2.10) 

where the symbol 77^ indicates the class of all A>dimensional vectors x = (x±, . . . , x^) such that 
Xi € r\ and x% 7^ Xj for every 1 ^ i 7^ j ^ k. As made clear in |25| Definition 3.1], the possibly 
infinite sum appearing in (|2.10p must be regarded as the ^(P) limit of objects of the type 
J2xer) k nA n /( x )> n ^ 1] where the sets A n € Z k are such that fi k (A n ) < 00 and A n f Z k , as 
n —7- 00. 

The following crucial fact is proved by Reitzner & Schulte in |25l Lemma 3.5 and Theorem 
3.6]: 

Proposition 2.9. Consider a kernel h € L],{^ k ) such that the corresponding U-statistic F in 
12.10\) is square- integrable. Then, h is necessarily square- integrable, and F admits a representa- 
tion of the form \2.2\) . with 

/i(xi) := hi(xi) = r\ h(xi, x fc _,) dfi k -\ Xi e Z\ (2.11) 

for 1 ^ i k, and /i = for i > k. In particular, h = and the projection /, is in lI' 2 (/j, 1 ) 
for each 1 ^ i ^ k. 
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Remark 2.10. Somewhat counterintuitively, it is proved in [25] that the condition h € L 1,2 (Z k ) 
does not ensure that the associated [/-statistic F in (|2.10p is a square-integrable random variable. 

As discussed in the Introduction, [/-statistics based on Poisson measures play a fundamental 
(and more or less explicit) role in many geometric problems - like for instance those related with 
random graphs. The aim of this paper is to estimate as precisely as possible the contraction 
norms involved in Theorem 12 . 41 when F has the form of a [/-statistic whose kernel h verifies some 
specific geometric assumptions. As anticipated, this significantly extends the analysis initiated 
in the second part of |10| - where we only focussed on [/-statistics of order k = 2 with kernels 
equal to indicator functions. In the sequel, we will mostly assume that, as the intensity of 
the Poisson measure changes with A > 0, the underlying kernel h x is the restriction of a fixed 
kernel h whose argument is deformed by a factor a x > 0. The following problem will guide our 
discussion throughout Sections 4, 5 and 6: it can be regarded as a special case of Problem 12.61 



Problem 2.11. Let Z, fj,\, r/ x and a\, A > 0, be as in the statement of Problem [2761 i R particular, 
r]\ is a Poisson measure on the space Z = X x M (note that Z is independent of A). Fix k ^ 1 
and let h be a fixed real-valued function on (R rf ) fc x M k whose restriction to (a x X) k x M k 
belongs to L\((a x X) k x M k ) for every A > 0. We are then interested in characterizing those 
collections of [/-statistics of the type 

F x :=F(h,X,M;a x ,r] X )= h(a x x), A > 0, (2.12) 

such that each F\ is square-integrable and moreover 

y/V&r{F(h, X, M; a x ,rj x )) 

as A — > oo. 

Sufficient conditions for the asymptotic normality of this type of [/-statistics, together with 
explicit estimates for the rates of convergence, will be derived in Theorem 16.21 



Examples 2.12. (i) Assume that either a x = 1 or h is homogeneous, meaning 

h(ax) = a?h(x), Va > 0, Vx G R d , (2.13) 

for some (3 > 0. Then the random variable F x in (j2. 12|) is a geometric U -statistic, where 
we use the terminology introduced in [25] . Examples and sufficient conditions for normal- 
ity are given in |25j. Using Hoeffding decompositions, in Section [7] we shall provide an 
exhaustive characterization of their (central and non-central) asymptotic behavior. 

(ii) If a x = A 1 /^, then the random variable F x in (I2.12p verifies the identity in law 
F X L ^ w £ h(x k ) 

where r]i is a homogeneous marked point process on M. d x M with control measure I® v. 
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3 Preliminary example: the importance of analytic 
bounds in subgraph counting 



Before tackling Problem 12.61 and Problem 12.111 in their full generality, we shall discuss a simple 
application of Theorem 12. 41 -1 to CLTs associated with subgraph counting statistics in a standard 
disk graph model. The aim of this section is to familiarize the reader - in a more elementary 
setting - with some of the computations developed in the remainder of the paper. In particu- 
lar, the forthcoming Theorem 13.31 involves [/-statistics that are based on stationary kernels, a 
notion that will be formally introduced in Section 14.21 in the general framework of marked point 
processes. One important message that we try to deliver is that the bound (|2.9p allows one to 
deal at once with all possible asymptotic regimes of the disk graph model - as defined at the 
end of the forthcoming Section 13.11 The idea that analytic bounds such as (|2,9p may be used to 
simultaneously encompass a wide array of probabilistic structures is indeed one of the staples of 
present paper and |10] , 

Remark 3.1. The CLT presented in Section [3~2l is both a special case and a strengthening of the 
findings contained in \23\ Section 3.4]. Indeed, in such a reference the author deduces multidi- 
mensional versions of the CLT below, but without providing explicit bounds in the Wasserstein 
distance. Also, when k = 2 (that is, edge counting) the results of this section in the case of a 
uniform density / (comprising the upper bounds) are a special case of [101 Theorem 4.9]. The 
notation used below has been chosen in order to be loosely consistent with the one adopted in 
reference |23] , 

3.1 Framework 

We fix d ^ 1, as well as a bounded and almost everywhere continuous probability density / on 
W 1 . We denote by Y = {Y{ : i ^ 1} a sequence of Revalued i.i.d. random variables, distributed 
according to the density /. For every n = 1,2, we write N(n) to indicate a Poisson random 
variable with mean n, independent of Y. It is a standard result that the random measure 
rj n = X)i=i ^Yii where S x indicates a Dirac mass at x, is a Poisson measure on R d with control 
H n {dx) = nf(x)dx (where dx stands for the Lebesgue measure). We shall also write rj n = r) n —\i n , 
n ^ 1. 

Let {t n : n ^ 1} be a sequence of strictly decreasing positive numbers such that lim n _^. 00 t n — 
0. For every n, we define G'(Y;t n ) to be the disk graph obtained as follows: the vertices of 
G'(Y;t n ) are given by the random set V n = {Yy, Y/v( n )} an d two vertices Yi,Yj are connected 
by an edge if and only if \\Y{ — Yj\\ K d € (0,i n ). By convention, we set G'{Y;t n ) = whenever 
N(n) = 0. Now fix k ^ 2, and let T be a connected graph of order k. For every n ^ 1, we 
shall denote by G' n (T) the number of induced subgraphs of G'(Y;t n ) that are isomorphic to V, 
that is: G' n (T) counts the number of subsets {i\, ...,ik} C {1, N(n)} such that the restriction 
of G'(Y;t n ) to {Yi ± , ...,Yi k } is isomorphic to T. We shall assume throughout the following that 
r is feasible for every n. This requirement means that the probability that the restriction of 
G'(Y;t n ) to {Y±, Yfc} is isomorphic to T is strictly positive for every n. 

We are interested in studying the Gaussian fluctuations, as n — > oo, of the random variable 
G' n (T). In order to do this, one usually distinguishes the following three regimes. 

(Rl) nt d n -> and n k (t^) k ~ l -> oo; 
(R2) nt d n -> oo; 
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(R3) (Thermodynamic regime) ntf^ — >■ c, for some constant c £ (0,oo). 

The following statement collects some important estimates from |23| Chapter 3]. Given 
positive sequences a n ,b n , we shall use the standard notation a n ~ b n to indicate that, as n — > oo, 
a n /K -> 1- 

Proposition 3.2. Under the three regimes (Rl), (R2) and (R3), one has that E[G' n (T)] ~ 
n fc_1 (i$*) Moreover there exists strictly positive constants c\, C2, C3 such that, as n — >■ 00, 

- under (Rl), Var(G' n (T)) ~ ci x n*^)* -1 ; 

- under (R2), Var(G'„(r)) ~ c 2 x n 2 * -1 ^) 2 * -2 ; 

- under (R3), Var(G'„(r)) ~ c 3 x n . 

The next subsection contains the announced normal approximation result. 

3.2 The CLT 

For every n 1, we set 

^ ^ = G' n (F) - E[G' n (F)] 
n[ ' Var(G^(r))V2 ' 

and we consider a random variable N ~ ^(0, 1). The following statement is the main achieve- 
ment of the present section. 

Theorem 3.3. Let the assumptions and notation of this section prevail. There exists a constant 
C > 0, independent of n, such that, for every n ^ 1, 

- under (Rl), d w (G' n (T),N) ^Cx (n*^)*" 1 )" 1 ^; 

- under (R2)-(R3), d w (G' n (T),N) sC C x n" 1 ^. 

/n particular, under the three regimes (Rl), (R2) and (R3) ; one has that G' n (T) converges in 
distribution to N , as n — >■ 00. 

Proof: By construction, the random variable G^(r) has the form 

G 'n( T ) = h Tt t n {x 1 ,...,X h ), 
(xi,...,x k )er)* ^ 

where the function hr t t n '■ ^ m — > K equals 1/fc! if the restriction of G'(Y;t n ) to {x±, x^} is 
isomorphic to T, and equals otherwise. Plainly, hr,t n has the following two characteristics: (i) 
hr,t n is symmetric, and (ii) /lr,t„ is stationary, in the sense that it only depends on the norms 
\\ x i ~ x jllR d ) * 7^ i- Using |25| Lemma 3.5], we deduce that the random variable G' n (T) (that has 
trivially moments of all orders) admits the following chaotic decomposition 



G' n (r) = E[G' n (T)] + Ylt(h l 



i=l 



where E[G' n (T)} = J (Rd)k hr^dfi*, 



hi(xi, ...,Xi) = ( ) / /i r ,t n (a;i,...,a;i,yi,...,y fe _j)^ l (dy l7 ...,dy k _i) := ( . ) hf, (x u Xj), 

10 



and indicates a (multiple) Wiener-Ito integral of order i, with respect to the compensated 
measure r\ n . Writing v* := Var(Gj,(r)), by virtue of ((2HJ) one has that the result is proved once 
it is shown that, for every j = 1, k and for every l^l^r^i^j^k such that / 7^ j, the 
following holds as n — > 00: under (Rl), 

II I. CO 114 , s MfcW J fcC?) 112 

»^'-<* - o r-> ^ . " r = o (_> \ (3,4) 



< V« fc (*n) (fe_1) 7 ' < V« fc (in) (fc " 1} 

and, under (R2)-(R3), 

IH.O') 114 II?)® J II 2 

'"^ (Mj =0(n" 1 ), 4 L {fln ) - = 0{n~ l ). (3.15) 

Observe that ll^-r t„lli, 4 (^ ) = H^t'L *j L Hi 2 (/^ )' so ^ na ^ we J us ^ ^ ave to check the second 
relation in (|3.14|) and in f|3. 15j) for every (i,j,r,l) in the set Q = {l^l^r^i^j^k, I ^ 
j} U {i = r = j, I = 0}. For every (i,j,r,l) £ Q we define the function /ip'f' r '^ : (IR d ) a -4 K, 
where a = a(i,j, r,l) = Ak — i — j — r + I, as follows: 

4'f '"(x,,...,^) = ftr, fe (4 1 2^-t>^^^ 4) )ftrA(xg j ,xf r ,x« I ,x< 4 >) x (3.16) 

xAr, fc (xe,x«.xm,xf)A r , fc (xW,xfi,xf».xJ 8 '). (317) 

where the numbered bold letters stand for packets of variables providing a lexicographic de- 
composition of {x\, x a ), for instance xj^ = (xi, »fe-i), x^ r = (xfc_ i+1 , Xfc„ r ), and so 
on. In this way, one has that (xj^, xf] r , xf} p x| 4) , x^., x^ r , x^, xf \ xj^) = (xi, x a ), 
and we set Xp equal to the empty set whenever p = 0. Observe that each function h^' r ' 1 ^ 
takes values in the set {0, A:! }; moreover, the connectedness of the graph T implies that the 
mapping (x 2 , x a ) i— s> h^p'{' r (0, x 2 , x Q ), where stands for the origin, has compact support. 
Applying (|2.6p . one has that, for every (i,j,r,l) G Q, 



^tL, *r ^r'L llr2C/*+^- r-I '> — n " / ^r'i' ' ^(^lj •••) x a)f{x\) ■ ■ ■ f(x a )dxi ■ ■ ■ dx 



a • 



where q = Ak — i — j — r + l, as before. Applying the change of variables xi = x and Xj = t n yi + x, 
for i = 2, q, one sees that 



,(*) 1 uU) ||2 



™ a (t^ _1 / ./(s) / 4T'°(0^2,...,ya)/(x + t„y 2 )---/(x + t n y Q ) C ?xd 2 / 2 --- ( iy a . 



a-l 



Since, by dominated convergence, the integral on the RHS in the previous equation converges 
to the constant 



f a (x)dx / hpl' r ' l \o,y 2 ,...,y a )dy 2 ---dy a , 



we deduce that 11^^ * l r ^rljl^ i+j-r-i^ = 0(n a (t^) a l ) for every (i,j,r,l) 6 Q. Using this 
estimate together with Proposition 13.21 and after some standard computations, one sees that 
relations (|3.14|) — (|3.15[) are in order, and the desired conclusion is therefore achieved. □ 
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Remark 3.4. 1. The CLT under the regime (Rl) could in principle be deduced from The- 
orem 3.4 in [23], via a Poissonization argument. However, since this strategy is based on 
an intermediate Poisson approximation, in this way one would obtain suboptimal rates of 
convergence (in the Kolmogorov distance). 

2. It is interesting to note that our proof of Theorem 13.31 is based on exactly the same change 
of variables that one usually applies in variance and covariance estimates - see e.g. [2'6\ 
Section 3.3]. 

As anticipated, in what follows we will show that the kind of arguments displayed in the 
previous proof can be extended to the framework of functionals of marked point processes. 

4 Technical estimates on rescaled contractions 
4.1 Framework and general estimates 

In this section, we collect several analytic estimates on the norms contractions of multivariate 
functions satisfying some specific geometric assumption. They will be used in further sections 
to study the asymptotic behavior of [/-statistics. 

Remark 4.1. (Some conventions) 

(a) Unless otherwise specified, throughout this section Z = X x M, fi\ and a\, A > 0, are 
defined as in the statement of Problem 12.61 In particular, X is a symmetric compact 
subset of M. d , for some fixed integer d ^ 1, with in its interior and negligible boundary. 
We shall also use the shorthand notation Z\ = ot\Z = (ot\X) x M, and Zj^ = (R d x M) k . 

(b) A point x in Z = X x M is represented as x = (t, m), where t G X is the spatial variable 
and m G M is the mark. A fc-tuple (x\, . . . , x k ) G Z^ is denoted by the bold letter 
Xfc; a fc-tuple (t\,...,t k ) G (M d ) fc is denoted by the bold letter t k . When considering 
symmetric functions f(x\, . . . ,x k ) on Z k , the chosen order of the Xi in the argument of 
/ is immaterial. We will sometimes separate the set of variables x^ = (xj,x k _j), where 
Xj is the j-tuple formed by the j first variables and x k _j the k — j last variables. By 
symmetry, one has of course that /(x^) = f(xj,Xk~j) = f(xk-j,xj). 

(c) In our framework, any geometric operation 6 applied to a point x = (t, m) is by definition 
only applied to the spatial coordinate. For instance, the s-translation is given by 6 s x = 
x + s = (s + t,m), whereas the a-dilatation (a ^ 0) is 5 a x = ax = (at,m). These 
conventions are canonically extended to A:-tuples of points, as well as to subsets of Z^. 
Observe that, if t G M d and t k = (t±, ...,tk) G X k , then t fc + t = (ti + t,t 2 + t, ...,t k + t). 

(d) Let A C R d . For every k ^ 2 there exists a canonical bijection ipk between the two sets 
(A x M) k and A k x M k , given by 

ip k : ((tx,mi),...,(t k ,m k )) (-)• (ti,...,t fe ,mi,...,m fe ), U G A, mi G M. 

To simplify our discussion, in what follows we will systematically identify the two sets 
(A x M) k and A k x M k by implicitly applying the mapping ifj k and its inverse. For instance: 
(t k ,m k ) G (A x M) k , where t k = (ti,...,t k ) G (R d ) k and m k = (mi, ...,m fe ) G M k is 
shorthand for ^ 1 {t kl m. k ) € (i x M) k ; writing x^ = (t k ,m k ), where x^ = (x\, ...,x k ), 
Xi £R d x M, means indeed that (t k ,m k ) = ifj k (x k ); if a function / is defined on some 
subset of (R d xM) k , we write f(t k ,m k ) to indicate the quantity f(4> k 1 (t k ,m k )) (and an 
analogous convention holds for functions defined on a subset of (M d ) fc x M k ). 
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The following statement shows how the norms of contractions are modified by a rescaling of 
the underlying kernels. 

Proposition 4.2. Let 0,7,7' > 0, h G L 2 (aZ k ; /i fc ), h! € L 2 (aZ q ; fi q ), where 1 < q ^ k. Define 

/(x fc ) = >yh(ax k ), x fc € Z fc , 
f>( Xq )= 1 'ti(ax q ), Xq eZ q . 

Fis l^Z^r^g^/c, and se£ m = q + k — r — I. For every A > one has that 

11/4 f'\\h(z^) = l 2 {i)\\a- d T +2l \\h* l r h'\\l 2{aZv) , (4.18) 

(the contractions f* l r f and h-k\. h! being realized, respectively, via fi\ and fi) and for p ^ 1, 

ll/IIV;/.J) = ^ (Ao " d) * l|;i| ^(^)- (419) 
Proof. The proof relies on a change of variables in (|2.6p where all variables are multiplied by a 
factor a. 

11/ *r /'lll 2 (Z m ; A t™) 
2/_,/\2\2J 



7 (T ) A 



/ X m dfi m / /i(a(x fc _ r ,y r _ i ,Zi))/i(a(x fc _ r ,y r ._ i ,Z;)) 



fr'(a(x' r , y r - h zi))h'(a(x'_ r , y r _/, z\))d\i 21 



□ 



= 7 2 ( 7 ') 2 A m+2 ' / «- d ( m+2/ )^ m+2 '/ l (x fc „ r ,y r _ / ,z i )/ i (x fc „ r ,y r _ i ,z;) 

/i'(x'g_ r , y r _/, zi)ti(x' q _ r , Yr-iA) 

\„-d\m+2l\\i ,1 r/||2 
= 7 (7) (Aa ) ||/t* r ft W^azm.fjmy 

Formula (|4.19p is proved by the same route. 

4.2 Stationary kernels 

The estimates of this section will be used to study the RHS of (|4.18p when a = ct\ depends on 
A and a\ — > 00, assuming that h and h' are invariant in the sense described below. Note that, 
if ot\ — y 00 and since X has in its interior, with our notation one has that 

Zoo = U x>0 a x Z = U a>0 ^a = R d xM. 

We say that a function h defined on is invariant under translations, or stationary, if 

h(t k ,m k ) = h(t k + t,m k ) 
for every t in M. d and every (t k , m k ) € Zj^. This property implies that, if t k = (t±, ...,t k ), 

h(t k ,m k ) = h(0, tfc_i - h,m k ) = h(t k -i - h,m k ), (4.20) 

where h : (IR^)^ -1 x M k — > M is given by h(s k -i,m k ) = h(0,s k -i,m k ) and, according to our 
conventions, t^_i = (t2,--,t k ). As usual, we identify the space (R rf )° withe a one-point set: 
this is consistent with the fact that a function on M. d is stationary if and only if it is constant. 
Note that, in the previous formula (|4.20p . the choice of the variable t\ among the variables ti, 
i = 1, k, is immaterial whenever h is symmetric in its k variables. Given x^ = (t k , m^) S Zj^ 
and t € M. d (that is, t is a spatial variable), we shall write, by a slight abuse of notation, 
x fc + t = (t k + t, m k ): for instance, with this notation a function h on Z^ is stationary if and 
only if h(~K k ) = h(~x k + 1) for every t € M. d and every x^. G Zj^. 
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Example 4.3. Assume M = 0. Then, a stationary symmetric kernel is given by 

h(ti, ...,t k ) = g(\\ti - tj\\ R d : 1 < i < j < k), 

where g : R^ fc — > R is some symmetric mapping. For instance, if g(a±, cifc(fc_i)) = 
llj la 3 <5) then h(t\, ...,tk) equals 1 or according as the distance between every pair of coor- 
dinates of the vector (t±, ...,ifc) does not exceed 5. Kernels of this type appear in Section [3j in 
the framework of subgraph counting, as well as in Section [8] where we deal with random radii 
of interaction. 

Remark 4.4. For technical purposes we also define, for X, X' C R rf , 

X + X' = {t + t' :t e X,t' e X'}, X = X- X = X + (-X) 

and let X be the largest symmetric set such that X — X C X . Remark that if tj 
then tj — t\ G X, for every 1 ^ j ^ k. Also, if each tj G X (j ' ^ 1), then one can exploit the 
symmetry of X and deduce that (t%, tf~) = (si, s^ — s±, s^ — si), for some s& = (si, . . . , Sfc) G 
It follows that if 93 denotes the change of variables tp(tk) = (ii,t2 — £1 , . . . , — ti), then 
one has that X fc C ip(X k ) C X x X k ~ 1 (this will be useful in further change of variables). We 
also set 2« = 1« x M q and Z' = i«xM'. 

Remark 4.5. (i) In (|4.20p . h is a function whose argument has k — 1 spatial variables and 
A: mark variables. To deal with this situation, we use the underlined symbol Xfc-i = 
(tfc_i,mfc) to indicate vectors such that the number of mark variables is one plus the 
number of spatial variables. Accordingly, we write Z^_ = X q x M q+1 , q ^ 0, to indicate 
the collection of all vectors mark variable in the product space. Write also = X q x 
M q+1 ,q > 0, Zjo = (M d ) q x M q+l ,^ = i q ® v q+1 . At first reading, one can consider the 
simple framework where M = 0, in which case we have simply x = x g = t q ,Z^_ = X q , q ^ 
1. 

(ii) Let A C R d . Analogously to the convention introduced in Remark l4.1l -(d). in what follows 
we shall identify the sets A q x M q+1 and (A x M) % x M x (Ax M) q ~ % by tacitly applying 
the canonical bijection between them (the chosen index i G {1, ■■■,q} will be always clear 
from the context). 

Let us introduce some further notation, which is required in order to express our main 
bounds. For the rest of the section we fix a probability density x(t) on R d , such that < x ^ 1. 

Remark 4.6. The estimates proved in this section continue to hold (up to some multiplicative 
constant) whenever the density x is bounded from above by some constant M > 0. The value 
M = 1 has been chosen in order to simplify some of the formulae to follow. Note that, in the 
applications developed in Section [Sj the role of the upper bound for M is sometimes taken to 
be different from 1. 

For s ^ 1 and t s = . . . ,t s ) G (R d ) s , we set 
x s (t s ) = x(ti) . . . x(t s ). 

Plainly, x s is a probability density on (R rf ) s , satisfying < x s+ i(t s+ i) ^ x s (t s ) ^ 1 for every 
t s G (R rf ) s and t s +i = (t s ,t), with t G R rf . For a measurable non-negative function h on 
Z^ 1 , fc > 1, define for p = 2, 4 

A H , p (h) = [ ^(t^r^x^y/?- 1 (4.21) 
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where tfc_i stands for the spatial variables of Xfe-i = (tfc— 1> m fc)i m fc £ We will simply 

write A p , whenever the density k is unambiguously defined (as it is the case, for the rest of the 
section). The following lemma is fundamental in this article, it gives estimates for the norms 
and the contractions of stationary functions restricted to growing bounded domains. 

Remark 4.7. Clearly, the quantities A x ^(h) and A x ^(h) defined above can be infinite. As 
shown in the applications developed later in the paper, the subtle point is, given a kernel h, to 
find a density x such that A Ky 2{h), A x ^(h) < oo, in such way that the bounds appearing in the 
forthcoming Lemma [4.91 are finite. Often, x{x) = C(l + ||s||) a for C > 0, a < well chosen, but 
if h presents a specific anisotropic behaviour, a more adapted density can be chosen. 



The following notation is borrowed from |10| . 

Remark 4.8. (Asymptotic equivalence notation) Given two mappings A i-> 7^, A i-> 5\, we 

write 7a x 5\ if there are two constants C, C > such that Cj\ ^ 5\ ^ C'j\ for A sufficiently 
large. We write 7^ ~ 6\ if S\ > for A sufficiently large and ot\/5\ — > 1. 

Lemma 4.9. Let h, g be symmetric non-negative stationary functions on and Z%^ respec- 
tively, 1 ^ q ^ k. Let h,g be the kernels defined by ft4.2Cty , and assume that A X! p(h) = A p (h) < 00 
and A x ^ p (~g) = A p (h) < 00 for some probability density x € (0, 1]. Then, for l^l^r^q^k 
such that I ^ k — 1, setting m = k + q — r — I, 



\ h *r ffllz,2( Z m ;At m) ^ W h {ZxZ ™-i .^ m) ^ \J M{h) A±(h) A 2 {g) A±{g) , ifr = l, 

(4.22) 



and /or p = 2,4, 

IHI^AZ*) ~ ^( X )WW LP{ZL) as X —¥ oo. (4.23) 

A^ote £/ia£ £/ie contractions appearing in formula ^4-2^ ) are realized with respect to the measure 
fj,. Concerning the middle terms of the two inequalities H^fy, the symbol Z x Z™~ 1 has to 



be interpreted as follows: in the norm involving the contraction one of the variables in the 
argument ofh is integrated over Z , while the others are integrated over Z^; in the norm involving 
the contraction *J, one of the variables in common between h and g is integrated over Z, while 
the others are integrated over Z^. A similar convention holds for the middle term of inequality 

Proof: Let us start by proving (|4.23j) . 



\hP{\Z k ) 



I h(-K k ) p dx= / fr( xfc-i ~ ti) p dti dx fc _i 

J\Z k JXXxX Z k - 1 

= X d [ h(* k -i ~ Xhfdhdxk^. 

JXxX Z k ~ 1 

For every t\ G X not on the boundary (i.e. almost every t\ for Lebesgue measure), since X is 
symmetric, X + 1\ contains in its interior, whence l\(x+ti) converges l-&.e. to l Rt j. It yields 
by Lebesgue theorem on X that 



LP(\Z k ) 



X d £(X) [ hi^f^ 
Jz k 
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with the domination 



fe( xfc-i - A ti) p dx fc _i < / h( x k _i ) p dx k _ 1 < oo. 



The first inequality in each row of (|4,22p is a trivial consequence of the fact that h and g are 
non-negative. Consider first the case r — I > 0. We have 



\h* r 9\\ 2 L 2( ZxZ ™-i) 



/ h(x k ^ r , yy_/, zj)fl>(x' yy_j, zj) 



h{x k -r, y r -i,z' l )g(x',y r ^i,z' l )dfj: 



'\j..m+2l 



where we assume that the integration over Z is performed on the variable y±. Call t\ the 
spatial component of y\. Since £i appears in the argument of each of the four kernels, we can 
introduce the change of variables such that t\ is subtracted from every other coordinate, that 
is: ti i-4 h,-x. k - r -t\ i-4 x fc _ r ,x^_ r -ti ^ x 5-n y r _n _ *i ^ y r _;_i> z / -*i i-4 z;,zj -tx i-4 zj. 
This yields 



ll fc 4< a(Z xzJS- 1 )=W 



m+2i 



: '»(3C*-r.y r _,_ 1 »Zi)^(3^_ P) y f ._,_ 1 ,Zi) 

^x fc _ rj ^_ ( _ 1 ,z{)5(x^_ r ,y r _ { _ 1J z{)rf/i' n+5M ~ 1 . 



where we have used the positivity and stationarity of /i, 5. Now we apply the Cauchy-Schwarz 
inequality to deduce that 



\h* l r g\\ 2 L 2( ZxZ ™-i) 



< £(X) y t% 



m—1 



d/i 2l h 2 (x fc _ r , y r _ z _ 1 , z z ) h 2 (x fe _ r , y r _ i _ 1 , z[ ) 



[ d^ r if d» l f{x! q _ r ,y r ^2,) J 



r-J-1 



G^ fc r+i /t 2 (x fe _ r ,y r ._ z _ 1 ,z^) 



,ez5 



d/i r 



r-i-l 



--:e(X)y/l iy /h 
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where the definitions of I\ and I2 are obvious from the context. Now we call tk- r +i the k — r 
spatial variables of x^_ r concatenated with the I spatial variables of Zj, we have 

r-l-l If dfl k - r+l ^ •'+- •'+- ^h 2 ! 

r-I-l\ cZ fc ~ r „c7l 



dff / ^ r Xk-r+l{tk-r+l)Xk-r+l{tk-r+l) h ( x fc-r, y„_,_ , , Z/) 

^ / dfi r ^ 1 / d^ fe ~ r+i X fc _ r+ i(t fc _ r+ 0^fc-r+K t fc-r+0~ 2 ^ 4 ( x A:-r,y i ,_,_ 1 ,Zi) 

■ / y r _,^6^o'- , -Vx h _ P e^- r ) «ie^i 

(4.24) 

< Aj(E). 

where we have used Cauchy-Schwarz inequality in the probability space 

{Z k ^ r+l ,x k _ r+l {t k _ r+l )dii k - r+l ). 



Performing similar computations for I2, it follows that ||/i*J.<jf|| „ m -i% ^ ^C^O \/ Ai(/i)^4 (<?)■ 



We now consider the more difficult case r = I ^ l,r < k. For z r ,z' r G (R ) r , denote resp. by 
t\ £ M. d and € R d the spatial variables of z\ and z[ (meaning that z\ = (ti, mi), X = (i' l5 m^) 
for some mi , m'i € Af ) . 



Il^sll 2 



Uz.z^) = J ZxzL+q _ 2r _M +q ~ 2T (/ zr ^Mx fc .,z r ) 9 (x;_ r ,z r )) 

= / dlI k+q h(x k _ r ,Z r )g(x' ,Z r )h(x k -r,Zr)g( X ', 3-D 4) 

= / dn k+q h(-K k - r -h,z r -h)g(xL_ r -t[ + t[ -h,z r -tx) 

h(x k - r -h+tx- t[,z' r - t[)g(x ' - t[,z r - t[) 



where xx = (ii,77ii) is the variable restricted to Z, and we make the change of variables 

ip : tx (-»• ti 

x fc— r ^1 1 ^ x fc— d 
X cj— r — ^1 l— ^ x <j— ri 

*i -tx h-> t;, 

z r _i — ti 1— > z r _ 1 , 

z'r-l ~~ *1 ^ Z-r-1- 



whose Jacobian is a triangular matrix with unit diagonal, and 99 satisfies ip(Z x Z k+q r ) 
X x Z^"" 1 . One has that 

^( x fc-r ~~ ^'l) Zr-l)9( x 5-r5 Z^._i)- 
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Applying Cauchy-Schwarz, and exploiting once again the positivity of the involved kernels, yields 
the estimate 



\ h ~ k r9\\ 2 L 2 {ZxZ rn-i ) ^^(X), / ^ ^ ^d^+i 1 h(x k ^ r ,z r _ 1 )g 2 (x' q _ r + t[,z r _ 1 )g(x' g _ r ,zf r _ 1 ) 

(4.25) 

J / dM fc+9_1 Mxfc-r-,Z r _l)^ 2 (Xjfc_ r - t' 1 ,zj._ 1 )^(x / ,zj,_!) 

(4.26) 

= : t{X)yfh^fh, (4.27) 



where the definition of ii and -Z2 is clear from the context. Assume first that q > r. Calling 
the first spatial variable of x'_ r , the term under the first square root satisfies 



h < 



2 j..fc-i 



si *' 



d^-r-ld^tis'l + *i,2d-r-l + i'l, Z,_l) 



79-2 



/ 



2 

dfj,i- r - 1 d\x r ~ x h (x fc _ r , i r _i)g 2 (si , x _ x , z^_ x ) , 



z z q ~ 2 



(4.28) 



where the double underscore means that we are integrating on a set where there are two more 
mark variables than spatial variables. Now, make the changes of variables Xg- r -i +^'1 ^ 7±q-r—li 
and then + 1[ 1— > t[ in their respective integrals . We have 



2 j..fc-i 



ii < y 

a' 1 ,t'„Xt-r^-,g(« li ) 2 X Z^ 1 



l>2r-l&?! 



9-2 



d/iff-»- 1 d/i'- 1 I 4 (f 1 ,2d_ r _ 1 ,z r _ 1 ) 



(4.29) 



r !/i (x fc _ r .,z r _ 1 )g 2 (si,x^_ r ._ 1 ,z^._ 1 ). 

— g — r — 1 '— r — 1 ^ ^oo 



We can now separate the integrals: writing tfc_ r and t r _i for the set of spatial coordinates of 
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Xfc_ r and zj,_ 1; respectively, we deduce that 

ii < / / / ^- r - l ^- l 5 4 (t'i,x;_ r _ 1 ,z f ._ 1 ) 

Jt'^md) \/ Vx'_ r _ 1) z r _ 1 e^r 2 ~~ 



/ \/ / ^-^(xfe-r.tr-i) 



< J / d^ift)" 1 / dM*- r - 1 ^ r - 1 ^(<i,4-r-l 5 2r-l) 

/ / d/jL k - r ^ k -r('tk-r)~ 1 / d^ r - 1 /l 2 (x fc _ r , Z^) 

// dfjT Kri^tr-i)- 1 f r'W.ii-r-l.i-l)^- 1 



< ^4(5)^2(^2(5) 

where we have used the Cauchy-Schwarz in the probability spaces 

(K d ,x(t)dt), (Z^ r ,x fc _ r (t fe _ r )^ fc - r ), (^,d//x r (t r )), 

exactly as we did in (|4.24j) . If r = Z = g < fc , there is no such variable as in the integral 
Ji, thus the previous method does not work, still there is an easier procedure. From (|4.28p we 
directly get, with obvious shorthand notation, 



4// t{tr-i)J[ h 2 ( Xk - r ,z r ^ 1 )t(z' r _ 1 )^A 4 (g)A 2 (h)A 2 (g) 
Jt' 1 ,x k _ r ,z , r _ 1 y iz r _i y iz r _i 

Remark that the expression of -Z2 can be obtained from that of I\ by inverting "Zi" and "5", 

and "-ti", "x"' and "x", "z" and "z"', and "fc" and "g", the fact that g < fc does not play any 

role. Thus we get the desired estimates. □ 



Definition 4.10. In the light of the previous statement, we call rapidly decreasing function any 
stationary function h on such that A^ iP (|/i|) < 00 for p = 2,4 and some probability density 
< k ^ 1. This terminology refers to the shape of h when d = 2: constant along diagonal 
lines and uniformly decaying sufficiently fast far from the diagonal. Remark that since ^ 1, 
given a rapidly decreasing function h, h has finite L 2 and L 4 norm on M. d . 

Lemma 14.91 is an expression of the fact that, when one computes the norm of a rapidly 
decreasing function h(x\, . . . ,x k ) on a product space Z\ X • • • X Z k , or of contractions of such 
functions, the order of magnitude of the result depends only on the measure of the smallest Zj. 

Remark 4.11. A sufficient condition for a function h to be rapidly decreasing is that 

|/i(t fe _i,m fe )| ^ H(t k -i), 

where H is a bounded function not depending on and such that H converges to zero at a 
subexponential speed as |]tfc_i || m^)*- 1 — * 00 • 
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5 Asymptotic normality for finite Wiener-Ito expan- 
sions 



N.B. Throughout this section, we apply the same conventions outlined in Remark 14. 11 (a). 

The analytical results of the previous section will be now used to deduce bounds on the 
speed of convergence of a variable with a finite Wiener-Ito expansion - under some specific 
scaling assumptions on the kernels. We will later apply these findings to [/-statistics. 



Theorem 5.1. Let a x > and F x be of the form \2. 8\) , where the multiple integrals are with 
respect to a Poisson measure on Z with control p: x = Xfi, and the kernels fi. x G L 2 s {Z qi , fi q x ') are 
such that, for each 1 ^ i ^ k, there is 7^ > and h; lX G L 2 (Z x h , fj, qi ) verifying 

fi,\(*i) = Ji,\h it x(a\Xi). (5.30) 

Assume that for each 1 ^ i ^ k there exists a nonnegative measurable function hi on Z^ such 
that 

1. hi is a rapidly decreasing function not identically equal to 0, 

2- \hi.\\ ^ hi on Z x , for every A > ; 

3 - \\K>^Lv{z^-^)^\\ h i\\ L v{z^-^i) as 00 for p = 2,A. 
Define 

Then 

k 

Yar(F x ) = a 2 x ~ afj^ ^i^x INI^^-i^-i) ~ a A™7 t -A m !' 
i=i 



B 3 (F x ;l) ^C^u x +u' x 

with 

u x : = -i«lmax{( 7liA7j , A ) 2 mf*- r+i } (5.31) 
°~ x (*) 

^: = ^max A^mf}, (5.32) 

0~ x i—l,...,k 

where max is defined as in K2. 7|) . One also has the estimate in the Wasserstein distance: 
(*) 



d w (F,N)^C^u x +Lo' x . 

Proof. Using in sequence (|4.19f> and (|4.23p . one deduces the following estimates on the variance 
k k 

<A = ^% ! ll/i,Alli 2( ^. M?) = /2^h m x WhxWl* (z^^i) 

i=l i=l 
k 

1=1 
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Now, in Bs(F\; 1), every kernel is normalized by <j\, whence every squared L 2 contraction norm 
must be divided by erf, as well as the 4th power of the L 4 -norm: 

Bs(F\: l) 2 ^ — r max II h \ -kl f~ \|| 2 , , „. + „._ r . H j max II fs illf ,.„„. 9i .. 

Using (|4.18p and (|4.22p . for each i,j, r, Z appearing in the argument of max we have 

(*) 

||/i,A *r fj' X \\ L 2( Z Qi+<lj-r-l. fl 'li + y- r - l ' ) 

I \2 H+<lj- r + l \\L ,1 u ||2 ^ nl \2 qi+Qj-r+l d 

= m x \\hi,\* r h ji x\\ L2 ^ zgi + qj -r-i. iigi + qj -r-i ) < C(7i iA 7j,A) m A a A 

because a| Zii and hi is a rapidly decreasing function. Turning to the L 4 norm, using again 
(l4~TH and (|Q3]I we have, for 1 < i < fc, 

ii j» n4 ^4 *7? ii 7 n4 ^ y^f/ 4 ^7 e£ 

II/<IIl*(Z*; M «) ^A m A HMIl^Z^) < C 7i,A™A «A 

because Zij € L 4 (^S,~ 1 ) (as already observed, this is an easy consequence of the assumption 
A^ihi) < oo). All the bounds are easily deduced, and the estimate on the Wasserstein distance 
follows from Theorem 12.41 

□ 

One can use this result to directly have asymptotic normality for random variables having 
a finite Wiener-Ito expansions. For variables with infinite expansion, one can for instance use a 
truncation argument - see e.g. |28j . In the sequel, the previous findings are applied in order to 
deduce asymptotic normality for [/-statistics. 



6 ^/-statistics with a stationary rescaled kernel 

N.B. In this section, the notation and framework of Problem [2JT] prevail. 

We assume that F\ is a square-integrable [/-statistic of the form (|2.12p with h a rapidly 
decreasing function on (recall that a rapidly decreasing function is stationary by definition). 
Recall that \x = t ® v, where v is a probability measure on the marks space M. For 1 ^ i ^ k, 
we set 

\h\ l (^ l )=( k )[ l/iKxi.Xfc-iJdA**"*, x.eZj,. (6.33) 

Note that the kernels \h\i are also stationary (just use the invariance of Lebesgue measure). 
We say that \h\ has rapidly decreasing projections if the quantity j4 XiP (|/ij|) = A p (|/ij|) (as defined 
in (|4.2ip ) is finite for some density x, and every 1 ^ i ^ k and p = 2,4. The following statement 
is the main achievement of the present section. 

Remark 6.1. It is easily seen that, if h has rapidly decreasing projections, then the kernels 
hi, i = 1, ...,k, defined in (|2.1ip necessarily satisfy Point (1) and Point (2) in Assumption 12.21 
In order to apply our results, in what follows we shall also require that the kernels \h\i (and 
therefore the kernels hi) verify Assumption 12.21 (3): for instance, a sufficient condition for this 
assumption to hold is that \h\ ^ H, where the function H only depends on the spatial variables 
tfc, is bounded and has compact support. 
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Theorem 6.2. In the framework of this section, assume that \h\ has rapidly decreasing projec- 
tions, and that the kernels |/i|j(xj), % = 1, ...,k verify Part (3) of Assumption UTB . Define 

m x = Xa x d . 

Then 

Var(F A ) X Q^mf" 1 max(l,m^ +1 ), 

and 

d w (F x ,N) ^ Ca-^ym^l,™^,™- 2 ^ 15 ) 
for some C > 0. 

Proof. According to (|2.1ip the kernels /j A of F x s Wiener-Ito decomposition are given by: for 
1 < i < k, 



fi,\(*i)= f zk _ h(a x (x i ,x k ^ i ))dfi k x 1 = (^j(Xa x d ) k 1 j ^h(a x Xi,x k -i)dfi k \ 



whence fi x is of the form (|5.30|) , with 



7i,A = m x \ 



hi,\(xi) = ( k . J / h(xi,x k ^)dfi k ' 

W Ja x Z k -> 



Our purpose is now is to verify Point 1 and Point 2 in Theorem 15.11 with hi = \h\i, as defined in 
Q6.33j) ■ Point 1 is an immediate consequence of the estimates 



«C |/i i)A (xi)| < ( k ) / [/i|(xi,x fe _i)d/ * = |/i|i(xi 

W Mr 



To prove Point 2 



• /Z A JZ \ JZ \ q=l 



S * JZ A ' g=l 

where Zj is decomposed in (zi,Zj_i) and the variable Zi has been subtracted to every other 
variable present in the integral, exploiting the stationarity of h. Writing X x = ot\X, we now 
make the change of variables 

V9(2:i,Zj_i,Z fc _ i , . . . ,Z fe _J — (Zi,Zj_i - ^i,z fc _ i - Zl, . . . ,z fc _ i - Zij, 

verifying 

X x x ^- 1 +P( fc -0 c ( p(Z^- 1 ^ k - i) ) QX x x z l ~ 1+p{k - l \ 
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we have 

v 



A g=l 

Since < £(-X") < oo, one has that £(X\) >c l^{Z\) X and for A sufficiently large 



Ai— — i) 

Z A g=l 



(<?) \A,,i+<l{k-i) 



Both sides converge to \\hi\\ p TV , 7 i . i% ^ A p {hi) < oo by hypothesis. We have with similar 
computations 



P 

(9) AJ.,i+<7(fc-i) 



J A ^^oo ,j = l 

and both sides converge to H^iH^p^; iy whence we have indeed H/i^aII^p ~ II^IIlp ~ ^aII^Hlp' 
and the Point 2 is verified. Using the notation of Th. 15.11 we have that (|5.3ip is verified, and 
for some constant C > 



Var(F A ) x x max (m\m^ %} ) x afm x max (m A l * ') x a^mf max(l 



i=l,. ..,fc i=l,...,k 

and 



> m A 



where, writing a\ = Var(.F\)> 

^A : = -4«A m ax7r,A7j,A m A 



1 d 2 2 i+J— r+J 



A 

1 d 2(2k-i-j) i+j-r+l 

— ajmaxm A m A 

-d -2(2fc-l) 4fe max i( m r -j_r+ ^ 



A max(l,m7 fe+1 ^ 2 



A 

a _d m2 max(m- 3fc+1 ,m- 2 ) 
A A max(l,m^ +1 ) 2 
_ d flifm A ^l 

a A X S -2(fc-l) . f . n 

m x II TTlA < 1 



and 
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W A : = \ . max ll x m\a{ 

(T^ i=l, ...,k 

-2d -2(2fe-l) l-k\-2 4(fe-i) j d 

= a A m A v 'max(l,m A j max 'm^a^ 

-d -2(2fc-l) 4fc /-, l-fc\-2 -3 -3(i-i; 

= q, m A m A max(l,m A j m A maxm A 

= a^m^ 1 max(l, m^~ k )~ 2 max(l, m x ^ k ^) 

= a ~d\ m \ l if ™A > 1 
A [m^iimx^l. 

Taking the overall maximum yields 



d w (F,N) ^ Cal d/2 y / max(l, m^ k , m~ 2ik ~ 1] ). 

□ 

The condition that /i has rapidly decreasing projections seems a bit abstract, so we give 
sufficient conditions below. The symbol k denotes a fixed probability density taking values in 
(0,1]. 

Lemma 6.3. Let h be a non-negative function on Z^ 1 , k ^ 1. Define for p = 2,4 

A' p (h) = [ TPWiK-i^-i) 1 -^//- 1 . 
Every projection 

hj{*j-i) = ( k . ) [ h(xj-i,^k-j)dn k ~ j ,1 < j < fe,x-_ x G Zoo J , 
for 1 ^ j ^ k satisfies 

Apihj) ^A' p (h),p = 2,4. (6.34) 
Proof. Holder's inequality yields, for p = 2, 4, Xj_i € Z^ -1 , 



< F , (x_ 1 ,x fe _j)x i; _j(tfc_j)~ ?, Xj fc _y(t fc _ J )d// fe ^ 
= / T?(x k -j,x j _ 1 )x k - j (t k - j ) 1 ~ p dfj, k ~ 3 , 



whence 



= / >f,-_i(tj_i) ^(x.Jd/i- 7 ' 1 

•/Zoo'- 1 

< / fc i X i _i(t i _i)- 1 x jfe _ j (t fe _ : ,) 1 - ? 'F(x fe _ 1 )d/x fc - 1 

< / )fe _ i x fe _ 1 (t fe _ 1 ) 1 -^(x fe _ 1 )d/i fe - 1 = A;(^). 



□ 
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As a consequence, h has rapidly decreasing projections if A L{h) < oo for p = 2,4 for some 
x (this condition clearly implies that h is itself a rapidly decreasing function) . 

7 Asymptotic characterization of geometric ^/-statistics 

In this section we work in the framework of Problem I2.11[ in the special case where ot\ = 1 for 
every A. As before, h € L 1,2 (Z k ) is such that each F\ defined by (|2.12p is a square-integrable U- 
statistic. In the terminology of |25| each Fx is a geometric U -statistic. In what follows, we shall 
use the theory of Hoeffding decompositions (see e.g. Vitale |33] ) in order to provide a complete 
characterization of the asymptotic behavior of the family Fx, as A — > oo, yielding a substantial 
generalization of the results proved in \2b\ Section 5]. Observe that, according to (|2.1ip . 



Fx = E[F X ] + ]T F iyX := E[F X ] + £ h(fi,x), (7.35) 
i=i i=i 

where 

/ i)A (xi) = A fc ~ l r\ J i hfaxk-Jdn*-' =: \ k -%(xi), Xi € Z\ (7.36) 

and each multiple integral is realized with respect to the compensated Poisson measure fjx = 
rjx - XfJ,. 

Also, since X is a compact set of W 1 and v is a probability measure, one has that, for every 
A, fJ-x(Z) = \0.{X)v{M) < oo and rjx(Z) is a Poisson random variable of parameter fj,x(Z). In 
order to state our main findings, we need the definition of a Gaussian measure. The reader is 
referred e.g. to |2H Chapter 5] for an introduction to Gaussian measures and associated multiple 
integrals. 

Definition 7.1. (i) A Gaussian measure over Z with control fx is a centered Gaussian family 
of the type G = {G(B) : n{B) < oo}, such that, for every B, C verifying fi(B), n(C) < oo, 

E[G(B)G(C)\ = n(BnC). 

(ii) Define ft, as the probability measure j^)- A Gaussian measure over Z with control jj, is a 

centered Gaussian family of the type G = {G(B) : fi(B) < oo}, such that, for every B,C 
verifying fJ,(B), n{C) < oo, 

E[G(B)G(C)] = ft(BnC). 

Remark 7.2. Using the self-similarity properties of the Gaussian distribution, one sees imme- 
diately that G L = v /i(Z)^ 1 / 2 x G. In what follows, we shall maintain the distinction between the 
two Gaussian measures G and G in order to facilitate the connexion with reference [5]. 

The following statement is the main result of this section. Note that Point 1 corresponds to 
Theorem 5.2 in |25|. 



Theorem 7.3. (Asymptotic characterization of geometric [/-statistics) Let q\ ^ 1 be 

the smallest integer such that H^a^ai) > 0, where the kernels {hi : i = l,...,k} are defined 
according to \7.36\) . Then, as A — > oo, 

Vai(F A )~Var(F 91)A )~cA 2fc -« 1 , 
for some c > 0, and moreover: 
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1. If qi = 1, F\ converges in law to N ~ ^V(0, 1) with an upper bound of the order A on 
the Wasserstein distance. 

2. Ifqi > 2, then \^/ 2 ~ k F x converges in distribution to 

V( qi ) := ^Z)^ 2 x if (V) L ^ w lg{h qi ), (7.37) 

where 1^ (resp. if?) indicates a multiple Wiener-Ito integral of order q\, with respect to 
a Gaussian measure G (resp. G) on Z with control ft = fi/fj,(Z) (resp. [i). 

Remark 7.4. 1. It is a well-known fact that a non-zero multiple Wiener-Ito integral (with 
respect to a Gaussian measure) of order strictly greater than one cannot have a Gaussian 
distribution (see e.g. Janson [HJ Chapter V]). 

2. Observe that 

E[V( qi ) 2 ] = qi l( k ) 2 [ ( [ h(x qi ,-K^ qi )^(dx^ qi )) 2 ^(dx qi ), 
\QiJ Jzn \Jz k ~n J 

which is consistent with the estimates on the variance contained in |25[ Section 5]. 

3. The content of Theorem l7.3l allows one to give a complete explanation of a counterexample 
provided in |25^ end of Section 5.1]. Consider indeed a Poisson process r\\ on Z = X = 
[—1, 1], with control measure given by Xdx. Define the kernel f(x\,X2) on Z 2 as follows: 
f{x\,X2) = 1, if x\X2 ^ and f(x±,X2) = —1, if X\X2 < 0. Then, the random variable 
F\ = Y2(x 1 X2)erix^ f{ x li x 2) i s a ^-statistic of order 2 such that q\ = 2, and Part 2 of 
Theorem 17.31 implies that \~ l F\ converges in distribution to V{2) = I^if), where G is a 
Gaussian measure on Z with control equal to the Lebesgue measure. Standard results on 
multiple stochastic integrals imply that V(2) is a non-Gaussian random variable having 
the same law as 

^-1 + ^-1-266=" 2(£-l), 

where (^1,^2) is a two-dimensional vector of i.i.d. centered Gaussian random variables 
with unit variance. In particular, one has that .E[T/(2) 2 ] = 8 and £^[y(2) 3 ] = 64. 

Proof of Theorem 17.51 For each 1 ^ j ^ k 

Var(F J , A )= J !|| / ,|| 2 2(ZJ;Atl) = c J A 2 ^, 

for some constant cj ^ (independent of A) such that c qi > 0. By orthogonality, 

k k 

Var(F A ) = A 2fc ]T Ci A^ = c qi X 2k ~^(l + £ c^X^) = c qi X 2k ~^(l + 0(A -1 )). 
j=i qi +i 



It follows that, as A — > 00 and writing F qi \ = F qii \/ i v /Var(F giiA ), 

Var(F A - F qi>x ) x £ Var (j^fc) < C,X~\ 
j><ji v 7 

for some finite constant C\ independent of A. 
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1. Assume h\ 7^ and write f\ := H/ill^f^.^/i- We can use Theorem 12.41 in the case k = 1 
and q\ = 1 (together with (|4.19p ). in order to deduce the bound 

\l+3(fc-l)||i. ||3 

d W {F^N) < dlAlU = c ^^A^i^ = C 2 A3— (3/ 2 )(— ) = C 2 A-/ 2 , 

for some constant C2 independent of A. The required estimate follows from the standard 
inequality d w (F x ,N) < d w (Fi,\, N) + Vax(F x - F lyX ) 1/2 , N ~ ^(0,1), as well as from 
the estimates contained in the first part of the proof. 

2. For every integer M ^ 1, we write [M] = {1, M}. Assume that gi > 1, implying that, 
for every i < q± 



/ fc(xi,x fc _i)/i* ^dxfc-i) = 0, 



almost everywhere cfy/\ Let {xi, . . . , x Vx ^) } be any enumeration of the support of i]\ inside 
Z, and observe that 

F\ = k\ ^2 h(x h ,...,x ik ). 
{ii,...,i k }c[vx(Z)] 

The standard theory of Hoeffding decompositions (see e.g. Vitale [33]) implies that F\ 
admits a unique decomposition of the form 

k 



i=( ?i 0'l,-Ji}C[r?A(^)] 

where = hq 1 (g 1 ) fJ>(Z) qi ~ k , and each .ffj is a degenerate kernel in i variables - in 
the sense that J z iJj(xj_i, x)fJ,(dx) = 0, almost everywhere dfi* . The strong law of large 
numbers (which is a consequence of the infinite divisibility of the Poisson distribution) 
implies that, for every i = qi, k, 



A fc -* ^ v ; (fc-i)!' 

in probability, as A —¥ 00. Using Theorem 2], we also deduce that, for every i = qi, k, 
the class 

Hi{x h ,...,x k ), A>0, 

{h,-,ji}C[vx(Z)] 

converges to 

and the conclusion follows immediately. 

□ 
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8 Applications 



The methods developed in the present article and in |10] are well tailored for dealing with spatial 
processes enjoying some translation-invariance properties. We present below some applications. 
We first consider the total mass of weighted edges of the random graph based on a fundamental 
object of stochastic geometry: the boolean model. Then, we show how our results can be 
applied to the study of a random telecommunication network, where devices are placed at 
random locations and have random radii of interaction. 

On the boolean model we consider a classical 2d-order [/-statistic, but the coverage of the 
random telecommunication network is quantified in terms of A;-th order [/-statistic, for k ^ 1. 

Throughout this section, we use the notation 



where d ^ 1 is some integer. 

8.1 Random graph defined on a boolean model 

The standard boolean model consists of iid random compact sets (grains) disposed at random 
locations (germs) in the space. The union of such sets stands out as one of the most used and 
most studied example of a stationary random closed set of M. d . The reference books |13| [271 132j 
provide a good presentation of the model and of its applications in epidemiology, forestry, or 
image synthesis. The specificity of the graph based on the boolean model, where two germs 
are connected if their respective grains intersect, is that the geometric behaviour of individuals 
is random, meaning the geometric rule of interaction is not determined in advance. Limit 
theorems are discussed for instance in [6l [15], in view of estimating parameters of the models 
like for instance the intensity of the the process or specific geometric quantities related to the 
typical grain. Molchanov and Heinrich [6l Section 9] mention that central limit theorems for 
second and higher order quantities are possible to obtain via a laborious application of their 
method. We develop here a simple procedure to deal with a random variable based on pairwise 
interactions of the model, that can easily be extended to higher orders. 

We now proceed with a formal description. In what follows, the mark space is the class 
K, of compact subsets of M. d , endowed with the Fell topology and the Borel cr-algebra (see |27| 
for definition and properties). We shall denote by r\ a marked Poisson measure with values in 
M. d x K, and control measure \x = £ ® u, where v is a probability distribution on /C. Of actual 
interest for modeling purposes is the random closed set defined by 



called boolean model with grain distribution v. We call typical grain a random compact set with 
distribution v. Denote by rf = {x : (x, C) € rj for some C G /C} the ground-process, and for 
every x G rf , C = C x the unique grain such that (x,C) G rj. For A > 0, we call rj\ (resp. rj^) 
the restriction of rj to Z\ = X\ x /C (resp. X\), and F r)x the corresponding random closed set, 
obtained from by removing all grains whose center lie outside X\. Remark that F r)x ^ F^nZx 
as x $l X\ does not imply that (C x + x) D X\ = 0. 

In what follows, we consider the asymptotic normality here of variables of the type 



X x = [-X 1/d , X 1/d ] d 



F v = |J (C + x) 
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where ip is some even real function on M. d such that p 2 is integrable on every compact set (note 
that these assumptions imply that G\ has a finite second moment). 

If for instance p(x, y) = \\x — y\\, G\ is the total length of the edges of the graph where two 
points of X\ in the ground process rf x are connected if their respective grains touch. Notice that 
the techniques used here could be used as well for a kernel tp(x — y;C,C) where the weight for 
each pair of points {x, y) depends also on the grains C x and C y actually attached to x and y. 

The variable G\ is a [/-statistic on Z\ = X\ x /C of order k = 2 with kernel h(x, y; C, C) = 
ip(x — y)l{(a;+c)n(y+C")^0} on Z\- The kernel h indeed belongs to L 1,2 (Z 2 ) because 



\h x (x,y;C,C')\ p dxdydv{C)dv(C') < / \<p(x - y)\ p dxdy < oo 



for p = 1,2. Furthermore h is stationary with respect to its spatial variables, with factorization 
given by 

h(x;Ci,C 2 ) = ^{x)l{ Cl n(x+c 2 )m- 

Let us introduce the probability that two independent germs centered respectively at and 
x have a non-empty intersection 

Xv(x) = P(Ci n (x + C 2 ) + 0), x € R d 

where C\ and C2 are two iid variables in fC with distribution v. It can be shown in the frame- 
work of stabilization theory (see for instance [T|, Theorem 2.1) that under reasonable decay 
assumptions on \v we have 



lim E\~ l G\ = [ <p( 

A->oo J R d 



x)Xv{x)dx. 

Theorem 8.1. Assume that ip and u satisfy, for some e > 0, and for some bounded strictly 
positive probability density, 

tp{x) p X v{x)>c(x) l - p dx < 00 (8.39) 



for p = 2,4. Then Var(G^) ~ cA with c > and 

d w (G x ,N) < C\- 1/2 
for some constant C, c > not depending on A. 

Proof. Let us apply directly Theorem 16.21 with the the rescaling = A _1 / rf on X\ (recall the 
formulation of Problem 12. Ill as well as Example 12 . 1 21 (ii) ) . A condition for asymptotic normality 
with a bound on the Wasserstein distance decreasing at the speed of A" 1 / 2 is that h has rapidly 
decreasing projections. Using Lemma [6.31 . the finiteness of (|4.2ip boils down to 



J\h(x;C,C')\ p x(x) 1 - p dx<oo, j> = 2,4, 



which directly is given by (|8.39p . □ 

Remark 8.2. Since Xv ^ L a sufficient condition in order to have ([8.39P is that the function 
(p converges to zero at a subexponential rate, cis 1 1 x J J — y 00 . The next statement contains an 
example about how one can select a measure v allowing to deal with a possibly increasing ip. 
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Proposition 8.3. Assume that f(x,y) = \\x — y\\P for some (3 > —d/2 and that the typical grain 
is comprised in a ball with random radius R with distribution v(dr) = C' a lf r ^i\r~ a dr, a > 1. 
Then, if 

a > 2(/3 + d) + 1, (8.40) 
the conclusion of Th. \8.1\ holds. 

Proof. Let e > be such that /3p + (1 — a) + (d + e)p — e < for p = 2, 4 (the inequality for 
p = 4 is sufficient), and let 

w 1 + ||x|[ d+e 

with the appropriate normalisation constant Cd, e - We have 

X v(x) < P(R + R' ^ \\x\\) 

where i?' is an idependent copy of R, which gives Xu{ x ) ^ C||x|| 1— " for some constant C > 0. 
Then, (f5739]> is implied by 

+oo 

r /3p r l-a r (d+e)(p-l) r d-l ^ r ^ ^ 



for p = 2,4. □ 

The functional is stabilizing, meaning it can be written as a sum of contributions over 
the points of n^, where the contribution of each point only depends on the intersection of 77? with 
a random ball centered in this point (for a proper introduction and summary of results, see [9J 
Chapter 4] and the references therein). Our results can be compared e.g. with the very general 
Theorems 2.1 and 2.2 in |24| . that imply in this specific framework the asymptotic normality of 
A _1 / 2 (Ga — EG\) if a — 1 > 150d (and an additional assumption on ip), giving a more restrictive 
condition than ()8.40p . Note that, to our knowledge, there is no rate of convergence for CLTs 
involving such polynomially stabilizing functionals in the literature. 



8.2 Coverage of a telecommunications network with random user 
range 

The content of this section is inspired by reference [1], where Decreusefond et al. use concepts 
from algebraic topology in order to study the asymptotic interactions between devices in a 
telecommunications network. 

In what follows, the devices are spread according to a Poisson measure on X\ = [— A 1 ' 6 *, A 1 ' 6 *], 
with intensity given by the Lebesgue measure. Two devices communicate within the network if 
their distance is smaller than a constant, say r. To account for the good coverage of the network 
by the devices, they consider, for each k ^ 2, the number \ of (k — l)-simplices, i.e. of the 
number of /c-tuples (x\, . . . , x^) inside whose pairwise distances are smaller than r, meaning 
that the group of k devices is indeed connected. Among other findings, the authors of [3] give 
closed expressions for the moments of \, and derive a general CLT. One should note that in 
[1] the metric of the torus is used in order to avoid edge effects. The problem can also be seen 
as a generalization of subgraph counting (see Section [3]) with random radii of interactions, for 
the particular graph of the fc-simplex. 
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The results contained in Section T4.2I of the present paper are precisely tailored to deal with edge 
effects for stationary spatial processes, thus we shall perform an analogous study by replacing 
the torus distance with the euclidean distance. We also introduce the assumption that members 
of the network have a random range, i.e. we assume that to each member x% of the network 
is assigned a random radius Ri, that the radii are independent and identically distributed, and 
that a simplex of k members is "connected" within the network if the ball around each member 
Xi contains every other member of the /c-tuple. 

Formally, we shall consider a marked Poisson process rj over M. d x K + , with intensity given 
by £(dx) ® v, where v is some probability measure on R + . Every realization of rj is therefore a 
collection of points of the type (xi,Ri), where every Ri indicates a random radius with law v. 
For every A, the number N k ,\ we are interested in counts the number of those {x\, Xk} C X\ 
that are in the support of r\ and such that \xi — Xj|| K d Ri, for every i,j = l,...,k. The 
following statement provides sufficient conditions to have a standard limit behavior (variance in 
A and convergence to the normal law at a speed at most of A" 1 / 2 in the Wasserstein distance). 

Theorem 8.4. Denote by F the tail function of the law of R\, that is: F(r) = P[R\ > r) = 
u((r,oo)}. If for some e > 

f F(r)r M ~ l+£ dr < oo, 

we have, for every k ^ 2 and as A — > oo, 

Var(A^ A ) ~ c fc A 
d w (N k , x ,N) ^C k X^/ 2 

for some C k ,c k > independent of X. Here, N k \ indicates a centered and renormalized version 
of Nk x> whereas N is a centered Gaussian random variable with unit variance. 

Proof. The random variable N k: \ is the [/-statistic of order k on (X\ x with symmetric 

kernel 

r «)«=l, •••,&) = l{||asi-x J -||<r<; l<ij<fc}- 
Such a kernel is clearly stationary with respect to spatial translations. Let e' = -prrT) anc ^ P u ^ 

n{x) x (1 + ||x||)~( d+e '). According to Theorem 16.21 and Lemma l6.3| a sufficient condition for 
the result is 



i=\ n iMi^'^-^fe-i) < oo 

/ yk — 1 



i=2 

for p = 2,4. We have, setting x\ = 0, 

k 



i= [ nn^n^')^- 1 ) / ni {ft>Mt(=1 k \\ Xi _ xl \\ } dv( Pl )...dv(p k )dK k 



fe-1 

i=2 



^(^J r (P-m+e')+d-i F{r)dr ^ 



fe-1 
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after doing a spherical change of variables for each Xi, whose Jacobian is smaller than , 
which concludes the proof because (p — l)(d + e') + d — 1 = pd — 1 + e ^ 4c? — 1 + e. 

□ 
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